Collective Data Mining: A New Perspective Toward Distributed Data Mining
نویسندگان
چکیده
This paper introduces the collective data mining (CDM), a new approach toward distributed data mining (DDM) from heterogeneous sites. It points out that naive approaches to distributed data analysis in a heterogeneous environment may face ambiguous situation and may lead to incorrect global data model. It also observes that any function can be expressed in a distributed fashion using a set of appropriate basis functions and orthonormal basis functions can be eeectively used for developing a general framework for DDM that guarantees correct local analysis, resulting in desired global data model using minimal data communication. The paper develops the foundation of CDM, discusses decision tree learning and polynomial regression in CDM for discrete and continuous variables, and describes the BODHI, a CDM based experimental system.
منابع مشابه
Integration and Interaction of Distributed Data Mining with Agent Technology
In recent years, more and more researchers have been involved in research on both agent technology and distributed data mining. A clear disciplinary effort has been activated toward removing the boundary between them, that is the interaction and integration between agent technology and distributed data mining. We refer this to agent mining as a new area. The marriage of agents and distributed d...
متن کاملDistributed Data Mining and Agent Mining Interaction and Integration: a Novel Approach
In recent years, more and more researchers have been involved in research on both agent technology and distributed data mining. A clear disciplinary effort has been activated toward removing the boundary between them, that is the interaction and integration between agent technology and distributed data mining. We refer this to agent mining as a new area. The marriage of agents and distributed d...
متن کاملInteraction and Integration of Agent Mining in Distributed Data Environment
In recent years, more and more researchers have been involved in research on both agent technology and distributed data mining. A clear disciplinary effort has been activated toward removing the boundary between them,that is the interaction and integration between agent technology and distributed data mining. We refer this to agent mining as a new area. The marriage of agents and distributed da...
متن کاملClustered Collaborative Filtering Approach for Distributed Data Mining on Electronic Health Records
Distributed Data Mining (DDM) has become one of the promising areas of Data Mining. DDM techniques include classifier approach and agent-approach. Classifier approach plays a vital role in mining distributed data, having homogeneous and heterogeneous approaches depend on data sites. Homogeneous classifier approach involves ensemble learning, distributed association rule mining, meta-learning an...
متن کامل